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DETAILED ACTION 
Claim Rejections - 35 USC § 103 

1 . The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 1 02 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

2. Claims 23-31, 33-45, 47-91 and 93-100 are rejected under 35 U.S.C. 103(a) as 
being unpatentable over LADD (U.S. Patent 6,269.336). 

As to claim 23, LADD teaches a conversational browser (voice browser), 
comprising: means for interpreting a user command (voice input) and for generating a 
request (content request) to access a CML file (markup language document), wherein 
CML comprises meta-information implementing a conversational dialog for interaction 
with the user in a plurality of user interface modalities including a GUI modality and 
speech modality (via the network access apparatus of the system allows the user to 
access (i.e., view and/or hear) the information retrieved from the information source 
wherein the information is in the form of machine readable data, human readable data, 
audio or speech communications, textual information, graphical or image data, etc (col. 

3. lines 40-46) (coi. 3, lines 40-46; col. 4, lines 36-43; col. 4, lines 52-58); and a CML 
processor (parsing unit) for parsing and interpreting a CML file to render the 
conversational dialog in one or more of the plurality of user interface modalities (col. 1 1 , 
lines 25-49; col. 11, line 66 -col. 12, line 24; col. 3, lines 40-46; col. 4, lines 36-43; col. 
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4, lines 52-58). It would be obvious to one skilled in the art that browser means for 
interpreting is a voice interface for receiving the voice commands. 

As to claims 24 and 25, LADD teaches a conversational browser (voice browser) 
of a computing device that provides a conversational user interface to render a 
conversational dialog (col. 11, lines 25-49). LADD also teaches that variations and 
modifications may be practiced on the system (col. 2, lines 10-14). However, LADD 
does not teach that the browser executes on top of an operating platform. Official 
Notice is taken in that it is well known in the art that a browser executes on a virtual 
machine to send and handle remote request and therefore would be obvious in view of 
LADD in order to send and handle voice requests. 

As to claims 26-29, LADD teaches a dialog manager (VRU server / interpreter 
unit) for managing and controlling the conversational dialog wherein the dialog manager 
allocates conversational engines (test to speech unit / automatic speech recognition 
unit) for rendering the conversational dialog by meta-information of a CML file (col. 9, 
lines 1-53; col. 13, lines 41-60). 

As to claims 30, 31 , 33 and 34, LADD teaches the user input command (voice 
input) can be input in the one or more user interface modalities (col. 11, lines 31-35; col. 
3, lines 40-46; col. 4, lines 36-43; col. 4, lines 52-58; col. 2, lines 48-66), the CML is 
implemented in a declarative fornnat encapsulating multi-modal dialog (col. 16, lines 5- 
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56). Official Notice is taken in that it is well known in the art that XML is a markup 
language and therefore would be obvious that the markup language of LADD is XML, 

As to claims 35-38, LADD teaches the input commands to the browser are voice 
commands (col. 11, lines 26-36), Therefore, it would be obvious to one skilled in the art 
that the since the commands are voice commands that navigates to a web page that the 
browser implements a "what you hear is what you can saf, a "say what you heard", a 
"say what you will hear", and a "mixed initiative dialog formats. 



As to claim 80, LADD teaches a method for accessing information, comprising 
the steps of: processing an input command (voice input) with at least one of a plurality 
of conversational engines (network fetcher); generating a request (content request) 
based on the processed input command (voice input) to access a CML file (markup 
language document) from a content server (mark up language server), the CML file 
comprising meta-information to implement a conversational dialog in a plurality of user 
interface modalities including a GUI modality and speech modality (via the network 
access apparatus of the system allows the user to access (i.e., view and/or hear) the 
information retrieved from the information source wherein the information is in the form 
of machine readable data, human readable data, audio or speech communications, 
textual information, graphical or image data, etc (col, 3. lines 40-46) (col. 3, lines 40-46; 
col. 4, lines 36-43; col, 4, lines 52-58); transmitting the request (content request) and 
accessing the requested CML file from a content server using a standard networking 
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protocol; and processing the nneta -information comprising the CML file to render the 
conversational dialog in one or more of the plurality of user interface modalities (via 
parsing the information and executing the file using the browser to display and/or play 
sound) (col. 11, lines 25-49; col. 11, lines 66 - col. 12, line 25; col. 14, lines 3-17; col. 2, 
lines 20-39; col. 2, line 59 - col. 3, line 5). 

As to claims 81 and 82, LADD teaches a conversational browser (voice browser) 
of a computing device executes the steps (col. 1 1 , lines 25-49), LADD also teaches 
that variations and modifications may be practiced on the system (col. 2, lines 10-14). 
However, LADD does not teach that the browser executes on top of an operating 
platform. Official Notice is taken in that it is well known in the art that a browser 
executes on a virtual machine to send and handle remote request and therefore would 
be obvious in view of LADD in order to send and handle voice requests. 

As to claims 84 and 85, LADD teaches customizing the CML file (markup 
language document) based on the conversational capabilities of the browser (the 
structure of the language can be designed specifically for voice applications); and 
registering the capabilities with the content server (via storing the files on markup 
language servers) (col. 15, line 60 -col. 16, line 21). 

As to claim 83, LADD teaches the steps are distributed using a conversational 
engine (test to speech unit / automatic speech recognition unit) and conversational 
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arguments (request data / document attributes) (col. 11, lines 25-49; col. 9, lines 1-53; 
col. 13, lines 41-60). 

As to claim 86-88, LADD teaches transcoding legacy content of the content 
server (information from the information sources) into CML based on predefined 
transcoding rules (via the parser unit) (col. 12, lines 15-24; col. 5, lines 8-11). 

As to claim 89, LADD teaches processing the meta-infornnation comprises 
playing back an audio file or generating synthesized speech output (col. 4, lines 50-61). 

As to claims 90, 91 and 93, LADD teaches the CML is implemented in a 
declarative format encapsulating multi-modal dialog (col. 16, lines 5-56). Official Notice 
is taken in that it is well known in the art that XML is a markup language and therefore 
would be obvious that the markup language of LADD is XML. 

As to claims 94-100, LADD teaches the CML (via nnarkup language document) 
comprises one of (1 ) a top level element that groups other CML elements; (2) an 
element that specifies output to be spoken to the user (3) a menu element for 
encapsulating a menu that presents the user with a list of choices wherein each choice 
is associated with a target address identifying a CML element to visit if the 
corresponding choice is selected; (4) a form element for encapsulating a form that 
allows the user to input at least one item of information and transmit the at least one 
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item of information to a target address; and (5) a combination thereof (col, 16, lines 29 - 
col. 17, line 49). 

As to claim 39, LADD teaches a system for accessing information (information), 
comprising: a content server (mark up language server) comprising content pages 
(mark up language documents), wherein the content pages are implemented using a 
CML (mark up language) to describe a conversational dialog for interaction with a user 
in a plurality of user interface modalities (view and audio) including a GUI modality and 
speech modality (via the network access apparatus of the system allows the user to 
access (i.e., view and/or hear) the information retrieved from the information source 
wherein the information is in the form of machine readable data, human readable data, 
audio or speech communications, textual information, graphical or image data, etc (col. 
3, lines 40-46) (col. 15, line 60 - col. 16, line 57; col. 3, lines 40-46; col. 4, lines 36-43; 
col. 4, lines 52-58); and a conversational browser (voice browser) for processing a CML 
page received from the content server to render its conversational dialog in one or more 
of the plurality of user interface modalities (col. 1 1 , lines 25-49; col. 1 1 , line 66 - col. 12, 
line 24; col. 3, lines 40-46; col. 4, lines 36-43; col. 4, lines 52-58). However, LADD does 
not teach that the browser executes on top of an operating platform. Official Notice is 
taken in that it is well known in the art that a browser executes on a virtual machine to 
send and handle remote request and therefore would be obvious in view of LADD in 
order to send and handle voice requests. 
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As to claims 40-44, LADD teaches the system comprises an IVR system 
implemented in CML (system capable of handling a voice markup language document) 
(col. 11, lines 25-49; col. 14, lines 3-17) and accessibly over a packet-switched network 
using a standard network protocol (col. 2, lines 26-39). 

As to claims 45 and 47-51 , LADD teaches the CML is implemented in a 
declarative format encapsulating multi-modal and speech dialog (col. 16, lines 5-56; col. 
16, line 58 - col. 17, line 49). Official Notice is taken In that it is well known in the art 
that XML is a markup language and therefore would be obvious that the markup 
language of LADD is XML. 

As to claims 52-54, LADD teaches a conversational browser (voice browser) on a 
computing device communicating over a communications network (col. 11, lines 25-49), 
LADD also teaches that variations and modifications may be practiced on the system 
(col. 2, lines 10-14). However, LADD does not teach that the browser executes on top 
of an virtual machine. Official Notice is taken in that it is well known in the art that a 
browser executes on a virtual machine to send and handle remote request and 
therefore would be obvious in view of LADD in order to send and handle voice requests. 
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As to claims 55 and 56, LADD teaches standard network protocols are utilized for 
accessing CML content pages from the content server (col. 5, lines 37-62; col. 2, lines 
26-39). 

As to claims 57-62, LADD teaches transcoding legacy content of the content 
server (information from the information sources) into CML based on predefined 
transcoding rules (via the parser unit) (col. 12, lines 15-24; col. 5, lines 8-11). 

As to claims 63-71, LADD teaches CML (via markup language document) 
comprises a plurality of capability-based frames, an active link, a link to conversational 
data files, a link to at least one distributed conversational engine, a link to an audio file 
for playback, a confirmation message tag, TTS markup, scripting language and 
imperative code, and a link to one of a plug-in or an applet for executing a 
conversational task (col. 16, line 29 -col. 17, line 49). 

As to claims 72-79, LADD teaches the CML (via markup language document) 
comprises one of (1) a top level element that groups other CML elements; (2) an 
element that specifies output to be spoken to the user (3) a menu element for 
encapsulating a menu that presents the user with a list of choices wherein each choice 
is associated with a target address identifying a CML element to visit if the 
corresponding choice is selected; (4) a form element for encapsulating a form that 
allows the user to input at least one item of information and transmit the at least one 
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item of information to a target address; and (5) a combination thereof (col. 16. lines 29 - 
col. 17, line 49). 



Response to Arguments 

3. Applicant's arguments filed December 22, 2005 have been fully considered but 
they are not persuasive. As to claims 23-31, 33-45, 47-91, and 93-100, Applicant 
argues that Ladd does not disclose or suggest conversational browsers or system for 
processing CML documents which comprise meta-information to enable interaction with 
the user in a plurality of user interface modalities including a GUI and speech modality 
to render the dialog in one or more user interface modalities. Applicant states that Ladd 
merely discloses a system in which a voice browser is capable of processing speech 
markup file and rendering a speech/audio interface only. The examiner disagrees. 
Ladd states that the network access apparatus of the system allows the user to access 
(i.e., view and/or hear) the information retrieved from the information source wherein the 
information is in the form of machine readable data, human readable data, audio or 
speech communications, textual information, graphical or image data, etc (col. 3. lines 
40-46). The output can include a speech communication, textual information, and/or 
graphical information (col. 4. lines 50-58). Because the output information is both 
spoken and displayed there exist a plurality of user interface modalities, i.e. visual and 
audio. Therefore, Ladd teaches the browser system for processing CML documents 
having meta information (requested information) to enable interaction with the user in a 
plurality of user interface modalities as disclosed in the claims. In addition, the parsing 
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and interpreting a CML file or CML application renders a dialog in one user interface 
modality by the use of the terms "one or more". Hence, the invention conceivably would 
still be met by the dialog being presenting in audio. Therefore, the rejection is 
maintained as detailed above. 



Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Lewis A. Bullock, Jr. whose telephone number is (571) 
272-3759. The examiner can normally be reached on Monday-Friday, 8:30 a.m. - 5:00 
p.m.. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Meng An can be reached on (571) 272-3756. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications nnay be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 
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